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\  SUMMARY 

'''An  exact  least-squares  lattice  algorithm  for  channel 
equalisation  is  derived  without  using  an  explicit  time  update 
for  the  equaliser  or  predictor  coefficients.  This  avoids  the 
need  for  auxiliary  vectors  and  scalars  within  the  analysis  and 
facilitates  a  theoretical  comparison  with  the  corresponding 
Fast  Kalman  algorithm  which  is  also  derived.  Both  algorithms 
incorporate  a  very  general  form  of  statistical  estimator  which 
includes  the  growing  or  growing-fading  memory  estimators  and 
the  sliding  window  estimator  as  special  cases. 
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1  INTRODUCTION 


The  problem  of  adaptive  channel  equalisation  has  received  considerable 
attention  in  recent  signal  processing  and  communications  literature  and  a  number 
of  different  equalisation  algorithms  have  been  proposed.  Channel  equalisation 
is  the  essential  mechanism,  for  example,  in  noise  cancellation  systems  ^  1  J 
where  the  objective  is  to  remove  from  the  signal  in  the  primary  channel  any 
components  which  are  correlated  with  a  secondary  or  reference  channel  signal. 

In  the  case  of  discrete  digital  signals  this  can  be  achieved  by  passing  the 
reference  signal  through  an  adaptive  linear  filter  whose  coefficients  are 
adjusted  to  minimise  the  mean-square  difference  between  the  filter  output  and 
the  primary  signal.  It  can  be  shown  that  the  coefficients,  when  optimised  in 
this  way,  satisfy  the  Wiener-Hopf  equation  and  hence  correspond  exactly  to  those 
of  the  optimum  Wiener  filter  associated  with  the  process  (assumed  to  be 
statistically  stationary).  In  some  applications  the  coefficients  are  only 
adjusted  during  an  initial  'training  period*  and  are  subsequently  held  constant. 
In  other  situations  they  are  updated  at  regular  intervals  and  the  filter  may  be 
able  to  track  gradual  variations  in  the  channel  statistics.  In  this  paper  we 
only  consider  the  case  where  the  coefficients  are  updated  every  sample  time. 

A  closely  related  problem  is  that  of  adaptive  linear  prediction  which  may 
be  regarded  as  a  special  case  of  channel  equalisation  in  which  the  reference 
signal  is  simply  the  primary  signal  delayed  by  one  sample  interval.  This  can 
also  be  achieved  using  a  linear  filter  as  described  above  and  the  net  effect  is 
to  decorrelate  successive  samples  of  the  primary  signal  leaving,  ideally,  a 
white  noise  sequence.  The  one-step-ahead  linear  predictor  finds  use  in  a  wide 
range  of  applications  such  as  speech  analysis  system  identification^^  and 

maximum  entropy  spectral  analysis  ^ 
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The  various  algorithms  vhich  are  commonly  used  for  adaptive  channel 

equalisation  may  be  separated  into  four  main  classes  according  to  whether  they 

are  based  on  a  transversal  or  lattice-type  filter  structure  and  whether  the 

coefficients  are  computed  exactly  or  evaluated  using  gradient  or  stochastic 
r  3  ] 

approximation  methods. 

In  the  context  of  transversal  filter  structures,  stochastic  or  gradient 
techniques  such  as  the  Widrow  "Least  Mean  Square"  (LMS)  algorithm  ^  ‘  ‘  are  widely 
used.  They  are  very  efficient  computationally,  requiring  a  number  of  arithmetic 
operations  (multiplications  or  additions  per  sample  time)  at  most  proportional 
to  L,  the  number  of  delay  stages  in  the  filter.  However,  their  rate  of 
convergence,  which  depends  on  detailed  statistical  properties  of  the  data,  can 
be  slow  and  the  coefficients  tend  to  fluctuate  about  their  converged  mean  values 
leading  to  misadjustment  noise  and  coefficient  error.  The  rate  at  which  the 
filter  can  track  changes  in  the  statistical  environment  is  therefore  limited. 

This  type  of  convergence  problem  is  not  encountered  if  the  transversal 

filter  coefficients  are  determined  using  a  direct  -method  such  as  the  "Recursive 

« 

Least  Squares"  algorithm,  which  can  be  related  to  the  theory  of  Kalman  filtering 
and  has  been  applied  to  the  channel  equalisation  problem  by  Godard  ^  During 
each  sample  interval  the  inverse  of  the  estimated  covariance  matrix  is  evaluated 
by  applying  an  exact  update  technique.  The  inverse  is  then  used  to  derive 
filter  coefficients  which  are  optimal  in  the  sense  that  they  satisfy  the 
Wiener-Hopf  equation  with  respect  to  the  current  covariance  matrix  and  cross¬ 
correlation  vector  estimates.  By  defining  the  estimators  for  these  quantities 
to  have  finite  memory  length  the  algorithm  can  be  modified  to  track  temporal 
variations  in  the  statistical  environment.  Because  of  the  matrix  computations 

which  are  involved  the  number  of  operations  required  to  carry  out  the  Recursive 
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Least  Squares  algorithm  is  proportional  to  L  and,  being  more  expensive 


computationally,  it  is  less  widely  used  than  algorithms  of  the  gradient 
type.  However,  Morf  and  co-workers  and  Falconer  and  Ljung  ^  ^  have 
shown  how  exact  least  squares  linear  prediction  and  channel  equalisation 
can  be  performed  using  a  number  of  arithmetic  operations  proportional  to 
L  by  means  of  the  "Fast  Kalman"  algorithm,  which  is  very  powerful  but 
does  not  seem  to  have  received  much  attention  in  practice. 

Much  of  the  attention  which  has  recently  been  devoted  to  adaptive 
channel  equalisation  concerns  the  use  of  lattice-type  filter  structures 

•  i 

which  were  suggested  by  Itakura  and  Saito  and  can  be  shown  to  have  very 
good  stability  properties  based  on  the  theory  of  orthogonal  polynomials1  m  1 
An  adaptive  lattice  filter  comprising  N  stages  can  be  used  to  carry  out 
an  efficient  Nth  order  linear  prediction  for  statistically  stationary 
processes  by  adjusting  the  reflection  coefficient  at  each  stage  (using 
direct  or  gradient  techniques)  to  ensure  that  the  energy  of  the  residual 
signal  from  that  stage  is  minimised.  In  effect  the  filter  is  then  being 
used  to  implement  the  Levinson-Durbin  (LD)  recursion  algorithm 
a  generalised  version  of  which  is  discussed  in  detail  in  Section  1. 
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Griffiths  has  pointed  out  that  the  set  of  backward  residuals  derived  from 

an  adaptive  lattice  filter  could  be  used  instead  of  the  delayed  signal  values 

in  a  transversal  filter  when  applying  gradient  techniques  to  the  channel 
r  I?  1 

equalisation  problem  1  .  The  fact  that  these  residuals  are  mutually  uncorre¬ 

lated  suggests  that  the  equaliser  convergence  rate  can  be  greatly  improved  in 
general.  However,  the  use  of  a  gradient-type  equaliser  algorithm  will  still 
lead  to  problems  of  misadjustment  noise  and  coefficient  error  and  if  these  are 
to  be  kept  at  an  acceptable  level  it  may  not  be  possible  in  practice  to  make  use 
of  the  improved  convergence  rate  which  is  offered.  Furthermore,  the  behaviour 
of  the  lattice  filter  in  the  initial  convergence  phase  when  the  statistics  can 
not  be  stationary  has  not  been  fully  analysed  in  this  context. 

[  13  1 

Morf  and  Lee  have  shown  that  the  entire  L-stage  least-squares  channel 

equalisation  process  can  be  carried  out  exactly  using  an  N-stage  lattice-type 

structure  (where  L  -  N+l)  by  generalising  the  underlying  LD  recursion  algorithm 

to  include  the  primary  channel  signal  as  well  as  the  reference  signal  and  to 

take  account  of  non-stationary  statistics.  This  approach  is  extremely  powerful 

since  it  leads  to  computationally  efficient  algorithms  (the  number  of  operations 

is  proportional  to  L)  and  produces  the  optimum  filter  coefficients  at  every 

sample  time.  Moreover,  it  is  based  on  a  filter  structure  which  has  good 

stability  properties  (at  least  for  stationary  statistics)  and  possesses  the 

desirable  property  that  no  stage  of  the  filter  affects  the  operation  of  any 

previous  stages.  Morf  and  Lee  have  produced  a  number  of  specific  algorithms 

f  14  1 

based  on  this  theory  and  Satorius  and  Pack  have  successfully  applied  one 

of  them  to  achieve  effective  equalisation  with  simulated  data  on  a  computer. 

In  order  to  develop  a  complete  least-squares  lattice  algorithm  it  is 
necessary  to  define  a  suitable  estimator  for  the  various  statistical  quantities 
which  are  involved  and  to  include  the  detailed  time  evolution  of  these 
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and 
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estimates  within  the  overall  lattice  structure.  Morf  and  Lee1 
Satorius  and  Pack  ^  ^  ^  have  tackled  this  problem  by  introducing  a  number  of 
auxiliary  vectors  and  scalars  into  the  problem  and  carrying  out  a  lengthy 
analysis  to  derive  explicit  updates  in  time  only  for  the  corresponding  trans¬ 
versal  filter  (equaliser  and  predictor)  coefficients.  Satorius  and  Pack 
even  make  use  of  the  Woodbury  inversion  lemma  in  their  analysis  ^  ^  and 

although  this  move  can  be  avoided  for  the  purpose  of  deriving  their  final 
result  it  does  emphasise  the  fact  that  the  use  of  auxiliary  vectors  and 
scalars  is  closely  related  to  the  Kalman-Godard  approach  in  which  the 
equaliser  coefficients  themselves  are  determined  using  a  direct  time  update 
technique  ^  ^  K  It  would  appear  that  having  solved  the  Wiener-Hopf  equation 
by  means  of  the  efficient  time  and  order  recursion  which  is  incorporated  in 
the  lattice  structure,  the  solution  is  effectively  being  derived  again  in 
order  to  provide  time  updates  for  the  relevant  statistical  estimators.  This 
makes  the  least-squares  lattice  approach  appear  much  more  complicated  than  it 
really  is  although  the  final  algorithm  is  the  same  whichever  analytic  route 
is  chosen. 

The  purpose  of  this  paper  is  threefold: 

(a)  To  demonstrate  how  the  necessary  updates  can  be  derived  much  more  readily 
by  means  of  a  very  direct  analysis  which  does  not  involve  an  explicit 
time  update  for  the  corresponding  predictor  or  equaliser  coefficients  and 
leads  to  more  efficient  algorithms  than  those  employing  auxiliary  vectors 
and  scalars; 

(b)  To  develop  a  detailed  algorithm  based  on  a  general  estimator  for  the 
relevant  statistical  quantities.  This  estimator  includes  as  a  special 
case  the  growing  memory  (which  is  used  in  the  Recursive  Least  Squares 
algorithm)  as  well  as  the  growing-fading  memory  (exponential  window) 
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and  the  sliding  window,  both  of  which  provide  an  important  time  tracking 
capability. 

(c)  To  compare  the  efficiency  and  form  of  these  algorithms  with  their  Fast 
Kalman  equivalents. 

In  Section  2  we  present  a  straightforward  derivation  of  the  generalised 
LD  recursion  algorithm  and  show  how  it  leads  naturally  to  the  use  of  lattice 
structures  for  linear  prediction  (Section  2a)  and  channel  equalisation 
(Section  2b).  The  analysis  is  expressed  in  terms  of  ensemble  averages  and 
serves  to  provide  a  clear  view  of  the  fundamental  way  in  which  the  lattice 
implementation  is  related  to  the  basic  structure  of  the  algorithm.  Detailed 
formulae  concerning  the  estimation  and  update  of  the  relevant  statistical 
quantities  are  not  necessary  for  this  purpose  and  bo  they  are  discussed 
separately  in  Section  3. 

In  Section  3  a  specific  estimator  for  the  various  statistical  quantities 
is  introduced  for  the  first  time, and  the  necessary  update  formulae  to  generate 
complete  linear  prediction  (Section  3a)  and  channel  equalisation  (Section  3b) 
algorithms  are  derived  in  a  very  simple  and  direct  way.  The  detailed  channel 
equalisation  algorithm  is  given  in  Appendix  1. 

The  use  of  auxiliary  vectors  and  scalars  to  derive  the  necessary  statistical 
update  formulae  is  discussed  in  detail  in  Section  4a  and  the  detailed  equaliser 
algorithm  which  results  is  presented  in  Appendix  2.  The  strong  resemblance 
between  the  auxiliary  vector  analysis  and  the  use  of  the  Kalman-Godard 
technique  to  solve  the  least-squares  channel  equalisation  problem  directly  is 
highlighted  in  Section  4b  where  the  Fast  Kalman  algorithm  is  derived  quite 
simply  using  some  of  the  auxiliary  vector  formulae  from  Section  4a.  The  Fast 
Kalman  algorithm  is  given  in  detail  in  Appendix  3. 
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2  GENERAL  LATTICE  AND  EQUALISER  STRUCTURE 


(a)  The  NL  Order  Linear  Prediction  Lattice 

For  the  purpose  of  explaining  the  underlying  structure  of  lattice 
equalisation  algorithms  it  is  convenient  to  consider  first  a  general  p^  order 
linear  prediction  problem  which  may  be  stated  as  follows.  Given  a  data  sequence 
{x(n) |n  “0,  1,  ...}  find  the  set  of  time-dependent  coefficients  or  "predictors" 
(ap(k;n) |k  *  1,  2,  ...p}  which  minimises  the  quantity 

E  (n)  -  <e2(n)>  (2.1) 

P  P 


where  e  (n)  »  x(n)  -  x  (n)  is  the  forward  residual  and  x  (n)  is  given  by 
P  P  P 


xp(n) 


r 

-  ap(k;n)x(n-k) 


The  angle  brackets  in  equation  (2.1)  denote  a  general  ensemble  average  over  the 
non-stationary  process  (x(n)}  which  is  assumed  to  be  real  and  scalar.  The 
residual  ep(n)  may  be  written  more  concisely  as 

ep(n)  ■  Ap(n)Xp(n)  ,  (2.3) 

where  we  have  defined  the  vectors 


Xp(n)  ■  ^x(n)  x(n-l)  ...x(n-p)^  ,  (2.4) 

Ap(n)  -  ^1  Ap(n)^  ,  (2.5) 

Ap(n)  “  ^ap(l;n)  ap(2;n)  ...ap(p;n)^  ,  (2.6) 

and  the  superscript  T  denotes  matrix  transposition. 

If  the  forward  predictors  are  regarded  as  non-statistical  variables, 
independent  of  (x(n)},  equation  (2.3)  allows  the  forward  residual  energy  Ep(n) 
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Q  (n)  =  <x(n)X  ,(n-l)>  . 

P  pl 


(2.14) 


An  alternative  partitioning  of  R  (n)  results  from  writing  X  (n)  in  the  form 


;(n>  -  (Xp-l(n)  :x<n"P>)- 


(2.15) 


from  which  it  follows  that 


where 


Rp(n) 


Vp(n) 


vp(n) 


v  (n)  =  <x  (n-p)  >  , 


(2.16) 


V  (n)  -  <x(n-p)X  ,(n)> 


(2.17) 


Equations  (2.5),  (2.7)  and  (2.12)  allow  the  energy  Ep(n)  to  be  written  as 

Ep(n)  -  qp(n)  +  2Ap(n)Qp(n)  +  Ap(n>Rp-l(n'1)Ap(n)  ,  (2.18) 

T 

and  minimisation  of  Ep(n)  with  respect  to  Ap(n)  yields  the  normal  (Yule-Walker) 
equations  for  the  forward  predictors: 


R  1 (n-l)A(n)  -  -Q(n) 

pi  P  P 


(2.19) 


Substitution  of  (2.19)  into  (2.18)  gives  the  minimum  energy,  or  mean-square 


error, 


£  (n)  A  min  (E  (n)) 
P  P 


qp(n)  +  Ap(n)Qp(n) 


(2.20) 


and  (2.19)  and  (2.20)  can  be  combined  in  the  form 


•  -V  j 

..  .  ..  *  j 


hfej' 
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(2.21) 


/*”>  \ 

Rp(n)Ap(n)  -  (  0p  I  • 


where  0  is 
P 


the  zero  p-vector. 


If  equation  (2.19)  were  solved  by  updating  the  covariance  matrix  every 
sample-time  and  inverting  it  directly  using  a  general  algorithm  such  as  the 

3 

Gaussian  elimination  procedure  a  large  number  of  arithmetic  operations  (0(p  )) 

would  be  required.  Alternatively  the  inverse  covariance  matrix  itself  could  be 

updated  every  sample  time  (the  Kalman-Godard  algorithm  ^  b in  which  case  the 

2 

number  of  operations  would  be  reduced  to  0(p  ).  It  is  well  known,  however,  that 
in  situations  where  the  statistics  are  stationary  and  the  covariance  matrix 
is  therefore  of  Toeplitz  form  the  number  of  operations  required  can  be  reduced 
to  0(p)  by  updating  the  solution  every  sample  time  using  a  recursive  procedure 
due  to  Levinson  and  Durbin  (LD)  t 10  1  l 11  1#  ^  point  which  does  not  seem  to  be 

so  widely  appreciated  is  that,  by  making  use  of  the  time-shift  properties  of 
Rp(n)  it  is  possible  to  generalise  this  procedure  so  that  it  applies  to  the 
analysis  of  time  series  for  which  the  statistics  are  not  stationary.  The 
generalised  LD  algorithm  will  now  be  derived. 


It  is  necessary  to  consider  in  addition  the  pth  order  backward  prediction 
problem,  associated  with  the  minimisation  of  the  quantity 

G  (n)  -  <r2(n)>  ,  (2.22) 

P  P 


where  the  backward  residual  r  (n)  is  defined  as 

P 


rp(n)  -  Bp(n)Xp(n)  , 


B*(n)  -  (B*(n),  1) 

P  P 


and 


j(n)  -  ^ bp(0;n)  bp(l;n)  . . .bp(p-l;n)^ 


is  the  vector  of  backward  predictors. 


(2.23) 

(2.24) 

(2.25) 
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Minimisation  of  the  backward  residual  energy  G^(n)  with  respect  to  the 
backward  predictors  at  time  n  yields  the  normal  equations 


R  i (n)B  (n)  »  -V  (n) 

Pi  P  P 


(2.26) 


where  the  partitioning  in  (2.16)  has  been  used.  In  a  fashion  similar  to  that 
used  in  forward  prediction,  the  normal  equations  may  be  expressed  in  the  form 


R  (n)B  (n) 
P  P 


(2.27) 


where  er(n)  is  the  minimum  backward  residual  energy. 
P 


The  generalised  LD  procedure  assumes  that  the  pth-order  forward 
prediction  problem  has'  been  solved  for  time  n,  and  that  the  corresponding 
backward  prediction  problem  has  been  solved  at  time  (n-1);  that  is  to  say 
that  Ap(n),  Bp (n-1),  £p(n),  e*(n-l)  are  known  and  satisfy  equations  (2.21) 
and  (2.27)  at  the  appropriate  times.  The  solution  of  the  (p+l)th-order 
forward  and  backward  prediction  problems  at  time  n  may  then  be  written  in  the 
form 


(2.28a) 


(2.28b) 


where  Kp+^(n)  and  Lp+1(n)  are  the  (p+1) th-order  forward  and  backward 
"reflection",  or  "PARCOR"  coefficients.  This  recursive  procedure  may  be 
verified  by  pre-multiplying  equations  (2.28a)  and  (2.28b)  by  Rp+^(n), 

_T  T 

partitioned  according  to  equation  (2.16)  when  multiplying  (A^(n)  0)  ,  and 

_T  T 

according  to  equation  (2.12)  when  multiplying  (0  Bp(n-l))  .  This  yields  the 


relations 


(2.29a) 


where  the  definitions  (2.3)  and  (2.23)  of  the  forward  and  backward  residuals 

have  been  used,  k  (n)  thus  has  the  form  of  a  partial  correlation  function 
P 

between  the  forward  and  backward  residuals,  and  L  (n),  K  (n)  (equations  (2.29b) 

P  P 

and  (2.30b))  may  be  interpreted  as  partial  correlation  coefficients. 
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Equations  (2.29),  (2.30)  and  (2.32)  provide  order  recursions  for  the 


forward  and  backward  residual  energies: 

k2(n) 


Vl(n)  "  ep(n) 


tr(n-l) 

P 


Vi(n)  m  s(n_1)  "  “! 


k2(n) 


ee(n) 


(2.34a) 


(2.34b) 


Recursions  for  the  residuals  themselves  are  found  by  pre-multiplying  equations 
(2.28)  by  xj+1(n)  and  using  (2.3)  and  (2.23),  which  yields 


Vi(n)  *  eP(n)  +  Vi(n)rp(n'1)  ’  (eo(n)  - x(n) 


and 


rP+l(n)  “  Lp+l(n)Cp(n)  +  rp(n_1) 


^eQ(n)  -  x(n)^ 

,  ^rQ(n)  -  x(n)^  . 


(2.35a) 


(2.35b) 


The  order  iterations  of  equations  (2.35)  may  be  realised  in  the  "lattice"  or 

til 

"ladder"  network  shown  in  figure  1;  they  are  used  to  carry  out  N  -order  linear 

prediction  by  solving  the  problem  recursively  for  all  orders p  (0  <  p  <  N) .  It 

can  be  seen  that  the  solution  of  the  linear  prediction  problem  has  been  effected 

using  the  "local"  lattice  correlations  k^(n)  rather  than  the  data  covariance 

values,  and  the  reflection  coefficients  L  (n) ,  K  (n)  instead  of  the  usual 

P  P 

transversal  predictor  coefficients  Ap(n),  Bp(n).  A  knowledge  of  the  kp(n)  per¬ 
mits  a  complete  evaluation  of  residuals,  reflection  coefficients  and  predictor 
values  using  equations  (2.35),  (2.34),  (2.29b),  (2.30b)  and  (2.28). 

We  note  the  following  points  arising  from  the  preceding  analysis. 

(i)  For  stationary  statistics  the  procedure  reduces  to  the  conventional  LD 

recursion  associated  with  Toeplitz  covariance  matrices.  In  this  instance, 
all  quantities  become  time-  (n)-independent,  backward  energies  equal 


13 


forward  energies,  and  backward  reflection  coefficients  equal  forward 
reflection  coefficients.  Equality  for  the  predictors  is  also  attained 
in  the  sense 

ap(k)  -  bp(p-k)  ,  k  -  1,  2,  ....  p.  (2.36) 

It  is  clear  that  the  more  general  extended  LD  recursion  is  well-suited  for 
situations  where  it  is  difficult  to  realise  stationarity  -  for  example, 
in  speech  analysis,  where  local  statistics  may  vary  significantly  with 
time,  or  identification  of  linear  systems  with  time-dependent  parameters. 
The  problem  of  tracking  time-varying  characteristics,  addressed  in  Section 
3,  is  one  for  which  the  general  treatment  above  is  appropriate. 

(ii)  The  backward  residuals  are  orthogonal.  This  is  easy  to  demonstrate  by 
evaluating  the  quantity 


h  (n)  A  <r  (n)r  (n) >  .  (2.37) 

mp  *  m  p 

Using  equation  (2.23),  this  may  be  written 

h(n)  -  B*(n)  <X(n)X*(n)>  B  (n)  (2.38) 

mp  m  m  p  p 

Equations  (2.8)  and  (2.27)  show  that  h  (n)  is  equal  to  er(n)  for  m  ■  p, 

mp  p 

and  zero  for  p  <  m.  For  p  >  m  equation  (2.27)  may  be  employed  in  trans¬ 
pose  form  to  prove  hnp(n)  zero  here  also.  Hence,  we  have  the  orthogonality 
relation 


<r  (n)r  (n)>  -  S_e*(n)  , 

m  p  mp  p 


II,  m  -  p 
0,  m  #  p 


(2.39) 
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It  is  this  property  of  the  lattice  filter  which  Griffiths  exploits  in 
order  to  accelerate  the  convergence  of  noise-cancelling  filters  which 

t  12  1 

operate  using  gradient  techniques  such  as  the  LMS  algorithm 
(b)  The  Lth  Order  Lattice  Equaliser 

The  general  p^  order  channel  equalisation  problem  amounts  to  finding  the 
set  of  time-dependent  equaliser  coefficients  (fp(k;n)|k  “0,  1,  ...  p}  which 
minimise  the  quantity 

E  (n)  A  <e2(n)>  ,  (2.40) 

p  -  p 

where  the  residual  signal 

P 

e  (n)  -  a(n)  +  V  f  (k;n)x(n-k) 

P  fco  p 

is  the  difference  between  the  primary  signal  (a(n)l  and  the  equaliser  output 
obtained  by  passing  the  reference  signal  (x(n)}  through  the  channel  equalisa¬ 
tion  filter,  e  (n)  is,  in  fact,  the  desired  output  from  a  noise  cancellation 
P 

system.  This  problem  may  be  solved  in  a  recursive  manner  analogous  to  that 
outlined  above  for  linear  prediction. 

In  order  to 

Fj(n) 

where 

F(n) 

and  the  extended 

xj(n) 


rationalise  the  notation  we  define  the  vector 


‘  Fp<n))  • 


1 


f  (0;n)  f  (l;n)  . 
P  P 


fp(p 


;n)^  , 


(2.42) 


(2.43) 


data  vector 

f 

a(n) 


-  ^a(n)j  xj(n)^  . 


(2.44) 
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The  residual  signal  e  (n)  may  then  be  written  in  the  form 

P 


«p(r0  "  fp(n)Vn)  • 


(2.45) 


and  the  error  criterion  in  (2.40)  becomes 


E  (n)  -  r%)R(n)F  (n) 

P  P  P  P 


(2.46) 


where  an  extended  covariance  matrix  R  (n)  has  been  defined  as 

P 


R  (n)  -  <X  (n)X  (n)> 

P  P  P 


(2.47) 


It  is  easily  verified  that  Rp(n)  has  the  partitioning 


Rp(n) 


Vi(n> 

|vp<n> 

1 

1 

VT(n) 

P 

!vp(n) 

(2.48) 


where 


and 


v  (n) 
P 


<x  (n-p) > 


(2.49) 


Vp(n)  ■  <x(n-p)Xp_1(n)> 

It  may  also  be  partitioned  in  the  form 

q„(n)j  Qp(°) 


Vn> 


/.V”4...vrL\ 

'  Q_(n)!  R  (n) 

P  P 


(2.50) 


(.2.51) 


where  qp(n)  is  the  mean-square  scalar  given  by 


qp(n)  -  <a  (n)> 


(2.52) 


Qp(n)  is  a  cross-correlation  vector  of  the  form 


Q  (n)  -  <a(n)X  (n)> 

P  P 


(2.53) 
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and  Rp(n)  is  the  covariance  matrix  defined  in  equation  (2.8). 

Use  of  equations  (2.51)  and  (2.46)  permits  the  minimisation  of  f  (n)  with 

P 

respect  to  F  (n),  giving  the  time-dependent  Wiener-Hopf  equation, 

P 


or 


R  (n)F  (n)  -  -Q  (n) 

P  P  P 


R  (n)F  (n) 
P  P 


(::) 


(2.54) 


(2.55) 


where 


c  (n)  -  min(E  (n) ) 

P  P 


(2.56) 


The  extension  of  the  linear  prediction  lattice  filter  to  a  lattice  equaliser 
may  be  deduced  from  the  equation 


Fp(„) 


/Vi<n>\  ^Cn)  /  0  \ 

Vo/  %<">  VV”>/ 


(2.57) 


which  relates  the  equaliser  weights  to  the  backward  predictor  coefficients  and 

may  be  applied  recursively.  Equation  (2.57)  may  be  verified  by  pre-mult ip lying 

both  sides  by  R  (n)  and  using  (2.55)  and  (2.48).  k  (n)  is  a  partial  cross- 
P  P 

correlation  function  given  by  the  expression 


k  (n)  -  \F(n)F  .  (n) 

P  P  Pi 


(2.58) 


It  may  also  be  written  in  the  form 


kp(n) 


^0  Ij(n)^  Rp(n)  ^(n)  C 


(2.59) 


by  making  use  of  equation  (2.48)  and  (2.58)  and  it  then  follows  from  (2.47) 
that 


kp(n)  -  < 


^0  Bj(n)j  Xp(n)xJ(n)  ^J_j(n)  0^ 


<r  (n)e  .(n)> 
p  p-1 


(2.60) 
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which  takes  the  fora  of  a  partial  cross-correlation  function  between  backward  and 
equaliser  residuals. 

-T 

Pre-multiplication  of  equation  (2.57)  by  X^(n)  gives  the  order  recursion 
for  the  equaliser  residuals 

k  (n)  /_  \ 

e  (n)  «  e  .  (n)  -  —*7- — r  (n)  (  e  ,  (n)  ■  a(n)J  (2.61) 

p  P_1  ej(n)  P  V  _1  / 

and  it  follows  immediately  that  the  Nth-order  linear  prediction  lattice  filter 
may  be  extended  to  apply  to  the  L  stage  channel  equalisation/noise-cancellation 
problem  (where  L  «■  N+l)  by  means  of  the  configuration  illustrated  in  figure  2 
(the  equaliser  output  e^n)  ”  a(n)  is  not  specifically  shown). 

A  point  worth  noting  at  this  stage  is  that  the  lattice-type  algorithm 
solves  the  Lth  order  equalisation  problem  by  progressively  solving  the  equiva¬ 
lent  problem  for  all  orders  p  <  L  and  making  use  of  recursions  in  time  and 
order.  Adding  extra  stages  to  the  filter  increases  the  order  of  problem 
which  may  be  solved  without  affecting  the  operation  of  the  previous  stages  in 
any  way.  This  is  to  be  contrasted  with  the  iterative  least-squares  methods 
(eg  the  Kalman-Godard  algorithm)  which  solve  the  problem  for  order  L  only  by 
updating  the  complete  coefficient  vector  solution  in  time  only. 

3  REALISATION  OF  AVERAGES 

(a)  Estimator  and  Linear  Prediction 

The  lattice  equaliser  recursions  of  Section  2  constitute  a  complete 
algorithm  for  data  processing  once  suitable  estimators  for  the  partial  correla¬ 
tion  functions  k  (n)  and  k  (n)  have  been  defined.  However,  as  the  k  (n)  and 
P  P  P 

kp(n)  (equations  (2.33)  and  (2.60))  take  the  form  of  correlations  between 
various  residuals  -  that  is, 
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filtered  data  -  any  explicit  estimator  for  these  quantities  will  necessarily 


render  A  (n) ,  B  (n)  and  F  (n)  functions  of  the  data  sequence  and  hence 
P  P  P 

statistical  variables,  violating  .  our  original  assumption  of  deterministic 
coefficients  and  predictors.  A  decoupling  of  the  coefficients  and  predictors 
from  the  data  is  therefore  essential  and  so  an  explicit  average  for  the  resi¬ 
dual  energies  will  be  defined  as  follows.  Corresponding  estimators  for  the 
partial  correlation  functions  will  emerge  as  a  result. 

The  general  estimator  for  the  forward  energy  is  chosen  here  to  be 
n 

E  (n)  -  /I  wn'q  e*(q,n)  ,  (0  <  w  <  1)  (3.1) 

q“n-M+l 

where  the  forward  residual  has  been  re-defined  to  be  a  function  of  two  time 
epochs  n  and  q, 

e  (q,n)  -  A^(n)X  (q)  (x(n)  -  0,  n  <  0)  .  (3.2) 

r  r  r 

This  separation  of  data  and  filter  evolution  permits  time  averaging  over  data 
while  the  filter  parameters  are  held  fixed.  Equations  (3.1)  and  (3.2)  then 
yield  the  expression 

E  (n)  »  A^(n)R  (n)A  (n)  ,  (3.3) 

P  P  P  P 

where  the  estimator  for  the  covariance  matrix  tales  the  form 
n 

R  (n)  -  wn~qX  (q)Xp(q)  ,  (3. A) 

q«n-M+l 

and  possesses  the  simple  time  update  recursion, 

R  (n)  -  wR  (n-1)  +  X  (n)XT(n)  -  w”x  (n-M)XT(n-M)  .  (3.5) 

P  P  P  P  P  P 


19 


The  general  estimator  described  in  equations  (3.1)  to  (3.4)  includes  a  number 
of  important  special  cases  such  as  the  growing  or  growing-fading  memory  esti¬ 
mator  and  the  sliding  window  estimator  whose  specific  form  and  relative  merits 
will  be  discussed  in  Section  5.  It  is  likely  that  the  final  algorithm  would 
be  applied  using  one  of  these  more  specific  estimators,  although  the  analysis 
which  follows  will  be  carried  out  using  the  general  form. 

It  is  easily  verified  that  the  time-shift  properties  (2.12)  and  (2.16) 
of  the  covariance  matrix  Rp(n)  also  apply  to  Rp(n);  together  with  the 
correspondence  between  equations  (3.3)  and  (2.7)  for  the  forward  energy,  this 
allows  the  complete  lattice  structure  described  in  Section  2  to  be  deployed 
with  the  specific  estimators  described  above.  For  this  reason,  we  shall 
henceforth  omit  the  caret  over  estimated  quantities.  The  only  alterations  to 
the  lattice  formalism  appear  in  the  definitions  and  order  recursions  of 
residuals.  By  analogy  with  equation  (3.2),  the  backward  residual  is  now 
defined  by 

rp(q,n)  -  Bp(n)Xp(q)  ,  (3.6) 

and  the  residual  recursions  are  obtained  upon  pre-multiplying  equations  (2.28) 
by  Xp(q): 

e  .(q,n)  -  e  (q,n)  *K  .(n)r  (q-1,  n-1)  (3.7a) 

p+l  p  p+l  p 

rp+l<q,n)  *  Lp+1(n)ep(q,n)  +  rp(q-l,  n-1)  .  (3.7b) 

It  will  become  clear  that  only  a  limited  number  of  these  residuals  will  need  to 
be  retained  for  the  final  algorithm. 

The  recursion  for  the  partial  correlation  function  k  (n)  is  now  easily 

P 

derived.  We  make  use  of  the  equation 
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kp(n) 


Vn'l))  Vi(n> 


(Xp(n)  0  J 


(2.32) 


and  difference  the  forward  predictors,  giving 


kp(n) 


-  ^0  ij(n-l))  Rp+1(n)  ^A^(n-l)  0^ 

+  ( 0  Bj(n-l)^  Rp+1(n)^0  jiJ(n)  -  AJ(n-l)J  0^ 


+  y°  5J< n-l)J  Rp+1(n)^0  |Ap(n)  -  AJ(n-l)J  0  j 

The  second  term  in  (3.8)  vanishes  upon  use  of  equations  (2.12)  and  (2.27), 
irrespective  of  (Ap(n)  -  Ap(n-1)),  and  substituting  (3.5)  (for  order  p+1) 
into  the  first  term  gives 


kp(n) 


v  (o  B*(n-1))  Rp+1(n-l)  (A*(n-1)  0^ 


(°  55<n“i))[xP+i(n)xp+i<n) '  wMxP+i(n-M)xp+i(n“M)J  (3-9) 


)T- 


X  I  A  . (n-1)  0 


Now,  equations  (2.16),  (2.21)  and  (2.31a)  show  that 


Vi(n_1)  lAp(n‘1) 


ee(n-l)  0T  k  (n 
P  P  P 


■l>)  • 


(3.10) 


and  it  follows  that 


k  (n)  -  wk  (n-1)  + 

P  P 


wkp(n-l)  ♦  ^0  B*(n-1)^  Xp+1(n)xJ+1(n)  (  AJ(n-l)  0^ 
-  wM  ^0  Bj(n-l))  Xp+1(n-M)xJ+1(n-M)  (AJ(n-l)  oj 


(3.11) 


Finally,  equations  (3.2)  and  (3.6)  provide  the  relationship 


k  (n)  *  wk  (n-1)  +  r  (n-1,  n-1)  e  (n,  n-1) 

P  P  P  P 


-  w  r  (n-M-1 ,  n-1)  e  (n-M,  n-1)  , 

P  P 


(3.12) 
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(3.17) 


R  (n)  =  wR  (n-1)  +  X  (n)XT(n)  -  w1^  (n-M)XT(n-M)  . 

P  P  P  P  P  P 


From  the  definition 


kp(n> 


(°  ®p<">)  %<“>  (Fp-l(n>  °j 


(2.59) 


a  time  update  is  obtained  by  differencing  F  . (n) ,  then  R  (n) ,  in  a  manner 

p-1  p 

identical  to  that  for  k  (n)  above.  Equations  (3.17)  and  (3.14)  then  give 

P 

k  (n)  =  wk  (n-1)  +  r  (n,  n)  e  . (n,  n-1) 

P  P  P  P“1 


-  w  r  (n-M,  n)  e  , (n-M,  n-1)  , 

P  P"1 


(3.18) 


or,  alternatively, 


k  (n)  =  wit  (n-1)  +  r  (n,  n-1)  e  ,  (n,  n) 

P  P  P  P~1 


-w r  (n-M,  n-1)  e  , (n-M,  n) 
P  P~1 


(3.19) 


(3.18)  and  (3.19)  show  that  only  two  types  of  equaliser  residual,  or"error”, 
need  be  carried  through  the  algorithm  -  either  (ep(n,  n-1),  e^in-M,  n-1))  ih 
(3. 18)  or  ( ep(n,n) ,  ep(n-M,  n))  in  (3.19);  the  necessary  lattice  residuals  are 
provided  in  either  case. 

The  complete  lattice  equaliser  algorithm  in  the  form  so  far  described  is 

set  out  explicitly  in  Appendix  1  for  an  equaliser  of  L  *  N  +  1  taps.  A  rough 

count  of  the  number  of  operations  involved  for  the  three  types  of  memory  is 

given  in  Table  1.  Here  divisions  are  counted  as  multiplications,  and 

2 

repeated  operations  (for  example,  computing  hp(n)  a  second  time)  are  omitted. 


»  t 


Algorithm 

Multiplications 

Additions 

Growing  memory 

13L  -  9 

9L  -  6 

Growing- fading  memory 

15L  -  9 

9L  -  6 

Sliding  window 

20L  -  14 

16L  -  10 

Table  1  Operation  count  of  algorithm  in  Appendix  1 
4  AUXILIARY  VARIABLES  AND  THE  FAST  KALMAN  ALGORITHM 

A  variant  of  the  algorithm  derived  in  the  last  two  sections  is  usually 
found  in  the  literature^  ^  ^ .  It  requires  fewer  residuals  to  update 

the  partial  correlation  functions,  but  involves  a  number  of  auxiliary 
variables.  The  auxiliary  vectors  and  scalars  will  be  derived  in  Section  4a, 
and  the  resulting  least-squares  lattice  algorithm  compared  with  that  of 
Appendix  1.  The  relations  derived  in  this  exercise  then  form  the  foundations 
for  the  so-called  "Fast  Kalman"  algorithm,  which  is  characterised  by  purely 
time-update  recursions,  and  is  described  in  Section  4b. 

(a)  Auxiliary  Vectors  and  Scalars 


It  was  noted  in  Section  3  that  the  general  update  for  k^(n)  in  equations 
(3.12)  and  (3.13)  involves  four  types  of  forward  and  backward  residual.  It 
is  seen  that  pairs  of  these  residuals,  eg  (ep(n,n-l),  ep(n,n))  differ  only  in 
their  second  time  argument,  associated  with  time  evolution  of  the  predictor 
coefficients.  These  residuals  may  therefore  be  related  by  means  of  an 
explicit  time  update  for  the  predictor  coefficients.  Such  an  update  is 
obtained  by  first  differencing  the  normal  equations  (2.21)  in  the  form 


R  (n)A  (n)  -vR  (n-l)A  (n-1) 
P  P  P  P 


* 
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(4.1) 


Taking  Che  lower  p  components  of  this  equation  (with  the  first  Rp(n) 
partitioned  as  in  (2.12)),  and  making  use  of  (3.2)  and  (3.5),  we  obtain  the 
relationship 


P-1 


(n-1)  ^Ap(°)  “  Ap(n-1)^ 


-  e  (n,n-l)X  . (n-1) 

P  P“1 

+  wMe  (n-M,n-l)X  . (n-M-1) 
P  P-1 


(4.2) 


Auxiliary  vectors  C  (n)  and  D  (n)  are  now  defined,  satisfying 
P  P 


R  (n)  C  (n) 


Dp(n) 


Xp(n) 


Xp(n-M)j  . 


(4.3) 


If  R  (n-1)  is  non-singular,  then  equation  (4.2)  may 

p-1 

R  , (n-1)  to  provide  a  time  recursion  for  the  forward 
p-I 

the  auxiliary  vectors. 


be  multiplied  by 
predictors  in  terms  of 


(4.4) 


Similarly,  differencing  the  backward  normal  equations  gives  a  time  recursion 
for  the  backward  predictors, 
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Bp(n)  - 


(4.5) 


If  auxiliary  scalar  variables  yp(n), 
equations 


6  (n),  a  (n)  are  now  defined  by  the 
P  P 


then  relations  between  (ep(n,  n-1),  ep(n-M,  n-1)  and  (ep(n,  n) ,  ep(n-M,  n-1)), 

T  T 

etc  result  from  pre-multiplication  of  (4.4)  and  (4.5)  by  Xp(n),  Xp(n-M): 


and 


Equations  (4.7)  allow  two  pairs  of  residuals  to  be  eliminated  from  the  lattice 
equaliser  recursions  upon  the  introduction  of  three  auxiliary  scalars. 
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In  order  to  complete  this  derivation  of  the  least-squares  lattice 

algorithm  it  is  necessary  to  make  use  of  the  following  order  recursions  for  the 
auxiliary  vectors.  /  u.,  x  i-fn.n) 


Cp(n) 


Dp(n) 


\  0  /  'p<”> 

fvl<n)N)  *  v-”- I 
\  „  /  «!(») 


(n) 


(4.8) 


(4.9) 


0  p 

These  may  be  readily  verified  by  multiplying  both  sides  of  the  equations  by 

the  covariance  matrix  R  (n)  partitioned  in  the  appropriate  manner. 

P 

Finally,  order  recursions  for  the  auxiliary  sqalars  are  obtained  by 

T  T 

pre-multiplying  equations  (4.8)  and  (4.9)  by  X^Cn),  X^Cn-M).  The  result  is 
as  follows. 


Yp(n) 

6p(n)  \ 

1 .  ( V1<") 

Vi(n) 

6p(n) 

y»)  / 

V  Vi(n> 

Vi(n) 

(4.10) 


Ep(n) 


(r^(n,n)  r  (n-M,n)r  (n,n)\ 

P  ?  | 

r  (n,n)r  (n-M,n)  r^(n-M,n)  > 

D  D  D 


In  the  present  context  (lattice  filters  employing  auxiliary  variables), 

equation  (4.10)  completes  the  alternative  formulation.  The  corresponding 

algorithm  is  given  explicitly  in  Appendix  2,  where  the  residuals  e^n.n), 

r  (n,n),  e  (n-M,n),  and  r  (n-M,n)  have  been  eliminated  (using  equations  (4.7)) 
P  P  P 

in  favour  of  the  set  (e  (n,n-l),r  (n,n-l),  e  (n-M,n-l),  r  (n-M,n-l). 

P  P  p  .  P 

A  count  of  operations  involved  is  given  in  Table  2. 
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Algorithm 

Multiplications 

Additions 

Growing  memory 

15L-11 

8L-5 

Growing-fading  memory 

17L-11 

8L-5 

Sliding  window 

34L-25 

21L-13 

Table  2.  Operation  Count  of  Algorithm  in  Appendix  2 


We  note  that 

(1)  the  reduction  in  the  number  of  residuals  achieved  by  the  introduction  of 
auxiliary  variables  applies  only  to  the  linear  prediction  portion  of 
the  equaliser  algorithm; 

(2)  the  saving  in  terms  of  number  of  variables  has  been  traded  for  a  decrease 
in  algorithm  efficiency  -  particularly  in  the  case  of  the  sliding  window. 

(b)  The  Fast  Kalman  Algorithm 

Order  recursions  for  auxiliary  vectors  are  the  essential  ingredients 
required  to  derive  the  Fast  Kalman  equaliser  algorithm  ^  7  which  is  also  a 
fast,  recursive  least-squares  method,  but  does  not  employ  a  lattice  structure; 
instead  it  provides  direct  time  updates  for  the  equaliser  coefficients  F^Cn) 
required  to  implement  an  adaptive  transversal  filter.  For  purposes  of  comparison 
with  the  least-squares  lattice  approach  we  present  a  derivation  of  the  Fast 
Kalman  algorithm  here. 

Differencing  equation  (2.56)  gives 

.  .  /V"’\  /V"-» 

y»V">  -  ■  Urt ;  -  "Up41 
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Z  (n-M,n-l)  ■  FT(n-l)X  (n-M)  -  a(n-M)  + FT(n-l)X  (n-M)  .  (4.14b) 

p  •  P  P  P  P 

To  obtain  updates  for  C  (n)  and  D  (n),  use  is  made  of  equations  (4.8) 

P  P 

and  (4.9)  together  with  the  alternative  order  recursions 


Cp(n) 


Dp(n)  « 


(  °  )' 
\cP-i(n-7 


e  (n,n) 

-E -  A  (n) 

Ep(n)  P 


e  (n-M,n) 

-E— -  A  (n) 

ce(n)  p 


(4.15) 


(4.16) 


which  may  be  verified  in  a  similar  manner.  These  relationships  (with  p 
replaced  by  p+1)  can  obviously  be  used  to  generate  direct  time  updates  for  the 
auxiliary  vectors,  provided  that  the  linear  prediction  quantities  involved 
(A,  B,  e,  r,  e)  can  also  be  updated.  (For  example,  equation  (4.15)  is  used  to 
form  C  . (n)  from  C  (n-1),  and  thus  equation  (4.8)  may,  in  principle,  be  used 

p+1  p 

to  extract  C  (n)  from  C  ^.(n)). 

P  P+r 

Assuming  knowledge  of  past  vectors  Ap+^(n-l),  Bp+1(n-l),  and  energies 

6  r  • 

e  ,(n-l),  e  ,(n-l),  the  time  update  (4.4)  is  used  in  the  form 
p+l  p+l 


AP+i(n)  *  Vi^-15  -  ep+i(n,n'1>cp(n_1) 


+  w  ep+1(n-M,n-l)Dp(n-l) 


(4.17) 


t 

I 

'J*'- 

if 


to  compute  Ap+^(n).  This  requires  forward  residuals  ep+^(n,n-l)  and 
ep+^(n-M,n-l)  to  have  been  found  from  Ap+^(n-l)  and  the  latest  data  vector 
using 
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i 
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(4.25) 


Vl(n)  "  Vl(n_1)  _  Vl(n>n'1)Cp(n)  +  wMrp+1(n-M,n-l)Dp(n)  , 


which  is  seen  to  depend  in  turn  on  C  (n) ,  D  (n) .  However,  substitution  of 

P  P 

Bp+1(n)  from  (4.23)  into  (4.25)  yields 


Vl(n)  l1  "  rp+l(n»n_1>  r 


Vl(n»n)  .  M 


r  . , (n-M,n) 

+  w‘r  .  (n-M,n-l) 


Vi(n) 


P+1 


Vl(n> 


=  Bp+1 (n-1)  -  rp+1(n,n-l)Mp+1(n)  ♦  wMrp+1 (n-M,n-l)Np+1 (n)  , 


ie 


Vi(n) 


(E 


Vi(n)  "  rp+i(n,n‘1)Vi(n)  +  w  Vi(n-M'n'1)Vi(n) 


P+1 


P+1' 


P+1' 


) 


(- 


rp+l(n,n-1)Sp  +  l(n)  +  w  Vl(n 


■M»I,-1)tp+l(n) ) 


,  (4.26) 


which  may  be  evaluated  after  the  backward  residuals  have  been  computed  from 


tp+l (n,n-l)  -  x(n-p-l)  +  Bp+1 (n-l)Xp (n) 


(4.27a) 


r  . (n-M,n-l)  »  x(n-M-p-l)  +  Bl  . (n-l)X  (n-M) 


P+1 


P+1 


(4.27b) 


The  auxiliary  vectors  C  (n) ,  D  (n)  can  now  be  obtained  from  equations  (4.23), 

P  P 

and  hence  the  equaliser  weights  from  (4.13).  The  complete  algorithm  is  given 
explicitly  in  Appendix  3. 

The  following  points  are  to  be  noted. 


(i)  The  Fast  Kalman  algorithm  provides  linear  prediction  residuals  ep+i  and 
rp+^  as  a  by-product  of  the  equalisation  computation;  in  particular,  it 
may  be  configured  as  three  transversal  filters,  executing  finite 
convolution  of  the  data  with  the  vectors  F  ,  A  ,,  and  B 

p  p+1  p+l 
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(ii)  The  algorithm  residuals,  e,  r,  and  e  are  arithmetically  equivalent  to 
corresponding  residuals  of  the  lattice  equaliser,  as  both  algorithms 
perform  exact  Recursive  Least  Squares  estimation. 

(iii)  The  algorithm  is  not  "expandable"  in  the  same  sense  as  that  employing 
a  lattice  configuration;  that  is  to  say  that  for  a  transversal  filter 
configuration,  increasing  the  number  of  weights  affects  all  previous 
stages;  this  may  give  rise  to  different  numerical  properties  between 
the  Fast  Kalman  and  lattice  algorithms. 

(iv)  Auxiliary  scalars  may  be  introduced  into  the  algorithm,  eliminating 

a  number  of  predictor  residuals,  in  a  manner  similar  to  that  in  Section  4a. 

(v)  As  in  the  case  of  the  lattice  equaliser,  the  data  vector  X  w(n),  as 

p+M 

well  as  the  sequence  (a(n),  a(n-l),  ....  a(n-M)X  must  be  stored  for 
the  sliding  window  estimator,  and  X^(n)  only  for  the  growing-memory 
estimator. 

(vi)  A  count  of  operations  involved  in  the  algorithm  given  in  Appendix  3 

is  provided  in  Table  3.  It  is  marginally  more  efficient  than  the  many- 
residual  lattice  equaliser  algorithm. 


Algorithm 

Multiplications 

Additions 

Growing  memory 

10L  +  3 

9L  +  2 

Growing-fading  memory 

10L  +  4 

9  L  +  2 

Sliding  window 

19L  +  6 

18L  +  4 

Table  3  Operation  Count  of  Fast  Kalman  Algorithm  in 
Appendix  3 
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5  COMMENTS  AND  CONCLUSIONS 


In  this  paper  we  have  derived  two  alternative  fast  algorithms  for  per¬ 
forming  exact  least-squares  linear  prediction  and  channel  equalisation.  The 
least-squares  lattice  algorithm  is  implemented  using  a  lattice  filter  structure 
whose  coefficients  are  evaluated  using  time  and  order  updates,  whereas  the 
Fast  Kalman  algorithm  is  implemented  using  a  transversal  filter  whose  coeffi¬ 
cients  are  determined  using  updates  in  time  only.  Both  of  the  algorithms 
incorporate  a  general  statistical  estimator  which  can  be  used  to  track  varia¬ 
tions  in  the  local  statistics  or  to  converge  to  an  assumed  stationary  value. 

It  includes  the  following  special  cases  which  merit  some  discussion. 

i)  When  M  is  infinite  and  w  *  1  all  data  points  from  zero  to  n  are  equally 
weighted.  We  call  this  the  "growing"  memory.  (The  data  pre-windowing 
condition  in  (3.2)  sets  the  lower  limit  of  summation  at  q  ■  0  in  (3.1)  and 
(3.4)). 

ii)  When  M  is  infinite  and  w  <  1  the  average  has  an  effective  memory  length 

of  -1/log  w  and  registers  predominantly  local  statistics.  We  call  this 
e 

the  "growing-fading"  memory. 

iii)  In  the  limit  w  *  1,  the  average  possesses  a  memory  of  M  time  units,  and 
hence  registers  only  local  statistics  within  that  interval.  We  refer  to 
this  as  the  "sliding  window"  memory. 

The  growing  memory  estimator  provides  exact  algorithms  arithmetically 
equivalent  to  a  Recursive  Least  Squares  procedure  and  is  suitable  for  obtaining 
a  fixed  parameter  filter  in  situations  where  the  statistics  of  the  input  data  are 
assumed  to  be  stationary.  The  adaptation  rate  decreases  with  data  running  time, 
and  the  filter  becomes  insensitive  to  changes  in  data  correlation;  it  is  there¬ 
fore  likely  to  prove  unsuitable  for  use  in  processing  long  data  sequences. 

In  addition,  the  forward  energies  (as  defined  above)  necessarily  increase  with 
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time,  although  overflow  may  be  avoided  fairly  simply  by  forming  averages 
involving  division  by  the  time  elapsed.  The  growing-fading  memory  and  sliding 
window  estimator  can  track  local  variations  in  data  statistics  and  may  be  used 
when  non-stationarity  is  assumed;  this  is  the  regime  for  which  the  generalised 
LD  recursion  is  most  appropriate.  The  growing-fading  memory  is  equivalent  to  an 
exponential  window,  while  the  sliding  window  has  a  fixed  memory  of  M  units, 
corresponding  to  a  rectangular  window  within  which  all  data  points  are  assigned 
an  equal  weight.  The  growing-fading  memory  algorithms  are  seen  to  be  far  more 
efficient  than  their  sliding  window  counterparts,  which  must  keep  track  of  the 
trailing  edge  of  the  window. 

The  actual  estimator  for  the  correlation  matrix  is 

n 

Vn)  =  (q)x*(q)  +  nwn+1i  i  ,  (5.1) 

v  q=n-M+l  v  v  v 

where  Ip  is  a  p  x  p  unit  matrix.  It  is  evident  that  the  effect  of  the  parameter 
n,  which  renders  the  algorithms  well-conditioned,  is  "forgotten"  by  the  growing 
estimator  after  times  beyond  the  memory  length,  but  is  retained  for  all  times 
by  the  sliding  window  estimator.  Although  the  conditioning  parameter  causes  a 
deviation  from  optimum  least-squares  adaptation  its  loss  of  influence  after  a 
given  time  may  be  considered  a  disadvantage,  especially  in  cases  where  the 
residual  energies  drop  below  the  available  computational  accuracy.  In  this 
instance  conditioning  may  have  to  be  reinstated  unless  the  sliding  window 
algorithm  is  employed.  In  either  estimator  accuracy  is  limited  by  residual 
statistical  fluctuations  associated  with  finite  time  averaging,  by  the  effect 
of  the  window  lagging  behind  the  present  data  input,  and  by  the  frequency  dis¬ 
crimination  which  arises  from  apodisation. 

Since  the  least-squares  lattice  and  Fast  Kalman  algorithms  compute  an 
exact  solution  to  the  same  least-squares  problem  they  must  be  arithmetically 


equivalent.  However  they  are  quite  different  in  structure  and  this  is 
fundamentally  associated  with  the  fact  that  the  equaliser  coefficients  in  the 
Fast  Kalman  algorithm  are  explicitly  updated  in  time  only,  whereas  the  coeffi¬ 
cient  in  the  least-squares  lattice  algorithm  are  updated  using  recursions  in 
time  and  order.  The  derivation  of  the  least-squares  lattice  algorithm  pre¬ 
sented  in  this  paper  highlights  this  distinction  by  avoiding  the  use  of  an 
explicit  time  update  for  the  predictor  coefficients  when  generating  a  time  up¬ 
date  for  the  reflection  coefficient  estimator.  This,  in  turn,  avoids  the  need 
to  introduce  auxiliary  vectors  and  scalars  into  the  least-squares  lattice 
analysis.  The  auxiliary  variables  are  required,  however,  in  the 
derivation  of  the  Fast  Kalman  algorithm  where  an  explicit  time  update  for  the 
equaliser  coefficients  is  essential  and  can  be  derived  efficiently  by  making 
use  of  the  time  and  order  recursive  properties  of  the  auxiliary  vectors, 
equations  (4.8),  (4.9),  (4.15)  and  (4.16).  It  is  interesting  to  note  that  all 
of  these  relationships  together  with  those  which  are  required  for  the  genera¬ 
lised  LD  recursion  can  be  derived  fairly  simply  by  using  the  following  covariance 
matrix  inversion  lemma 


A  (n)A*(n) 

— E E 

Ep(n) 


(5.2a) 


(5.2b) 


Equation  (5.2)  applies  to  the  specific  estimator  R^(n)  as  well  as  the  ensemble 
average  and  can  be  deduced  quite  simply  by  inverting  equations  (2.12)  and  (2.16) 
directly  and  making  use  of  the  normal  equations  (2.21)  and  (2.27). 
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The  Fast  Kalman  algorithm  has  been  shown  to  be  marginally  more  efficient 
(in  terms  of  arithmetic  operations)  than  the  least-squares  lattice  algorithm, 
but  is  is  reported  to  be  relatively  unstable  ^  The  least-squares  lattice 
algorithm  on  the  other  hand  has  a  number  of  desirable  numerical  properties 
which  arise  from  the  way  in  which  it  progressively  reduces  the  residual 
energy  in  a  stage-by-stage  manner.  The  internal  accuracy  or  wordlength 
requirements  for  this  type  of  calculation  are  generally  less  stringent,  and  the 
stage-by-stage  property  ensures  that  the  optimum  number  of  stages  can  be 
determined  by  inspection  as  the  calculation  progresses.  The  least-squares 
lattice  algorithm  is  likely, therefore,  to  have  superior  stability  properties. 
However,  the  stability  of  adaptive  algorithms  in  situations  of  non-stationary 
statistics  is  much  more  difficult  to  analyse  and  less  well  understood  than 
those  which  operate  in  a  stationary  environment.  For  example  in  the  non- 
stationary  case  the  stability  of  a  lattice  filter  is  not  determined  by  whether 
or  not  the  reflection  coefficients  have  magnitude  less  than  unity,  or  the 
poles  of  the  corresponding  transfer  function  lie  inside  the  unit  circle.  A 
detailed  investigation  of  the  stability  properties  of  these  algorithms  would  be 
extremely  valuable  and  is  likely  to  be  the  subject  of  much  further  research. 
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APPENDIX  1 


THE  LATTICE  EQUALISER  ALGORITHM 

1)  Define  the  scalars 

1)  ee(n),  er(n),  er(n-l);  e1(n),  r^n),  r^(n-l)  (i  =  a,  b,  c,  d)  ; 

P  P  P  P  P  P 

£p(n) ,  eb(n) ,  kp(n) ,  (p  «  0,  1,  . . .  N) 

{Comment:  Residuals  {e1(n)}  are  to  be  identified  with  those  of  Section3  in 

P 

the  following  manner,  (ea(n)  eb(n)  ec(n)  e^(n))  ■  (e  (n,n-l)  e  (n-M,n-l) 

P  P  P  P  .  P  P 

e  (n,n)  e  (n-M,n))f  with  a  similar  correspondence  for  the  (r1(n)},  and 
P  P  P 

(ea(n)  eb(n))  -  (e  (n,n-l)  e  (n-M,n-l)).} 

P  P  P  P 

V 

ii)  kp(n),  Lp+1(n)»  Kp+1(n)  (P  “  0,  1,  ...  N-l). 

2)  Initialise  the  algorithm  as  follows, 

i)  eVl)  -  er(-l)  -  2)  -  n  .  (p-0,  1,  ...N) 

P  P  P 


{Comment:  This  assignment  renders  the  initial  covariance  matrix  diagonal, 
with  entries  equal  to  n,  which  may  be  set  equal  to  a  small,  positive  value.} 

ii)  r1(-l)  (i  -  a,  b,  c,  d)  -  k  (-1)  -  k  (-1)  -  K  (-1)  -  L  (-1)  -  0  ,  (all  p) . 
P  P  P  P  P 

3)  For  each  successive  value  of  n,  starting  at  n  ■  0,  and  before  iteration 
over  p,  assign  the  following  values  to  p  ■  0  parameters. 


ea(n)  -  e^fn)  "  ra(n)  *  r^Cn)  * 
eb(n)  *  e^n)  *  rb(n)  “  *g(n)  "  *(n-M) 


with  x(n)  ■  0  for  n  <  0 
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S+l(n) 


Vi(n) 


Vi(n) 


k  (n) 

_ E _ 

e*(n) 


)  ■  •  v,  ■»(':“) 

/  \en(n)/  \rD(n-1)/ 


Cw*)  -  ^(m>)  * 

^rp+l(n)/  \ep(n)/ 


5)  Perform,  for  p  from  0  to  N,  the  equaliser  recursions 


®p(n) ' 


ep(n)> 


kp(n-l)  /#«> 

■>~l)  \r\ 


(n) 


k  (n)  -  wk(n-l)  +  r^(n)e®  . (n)  -  wMr^(n)e^  .  (n) 

P  P  P  P~1  P  P~1 


»  d  c 

The  output  of  the  predictor  may  be  taken  to  be  either  eN(n)  or  eN(n), 
and  that  of  the  equaliser  as  e*(n). 

The  growing-fading  memory  algorithm  is  obtained  by  setting  to  zero  the 
residuals  e^,  e^,  r\  r^  in  1)  to  5)  above.  (This  effectively  omits  any 
explicit  functions  of  M,  representing  the  lower  limit  of  the  window.)  The 
growing  memory  algorithm  may  then  be  obtained  by  setting  w  *  1. 

For  the  sliding  window  algorithm,  w  is  set  equal  to  unity  in  1)  to  5), 

and  all  residuals  retained.  (In  none  of  these  cases  does  multiplication  by 
M 

w  need  to  be  performed.)  This  last  algorithm  is  arithmetically  equivalent  to 
[17] 

that  of  Ahmed  et  al,  who  employed  a  Woodbury  inversion  lemma  to  update  the 
covariance  matrix. 
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sagmaaga. 


APPENDIX  2 


THE  AUXILIARY  VARIABLE  LATTICE  EQUALISER  ALGORITHM 

1)  Define  the  scalars 

1)  E*(n),  ^r(n).  e^(n-l);  ea(n),  eb(n),  ra(n),  rb(n),  ra(n-l),  rb(n-l), 

PPP  PPPPP  P 

ea(n),  eb(n) ,  kp(n),  (p  -  0,  1,  . . .  N) 

ii)  kp(n),  Lp+1(n),  Kp+1<n),  Yp(n),  Sp(n),  ap(n)  (p  *  0,  1,  ...  N-l) 

2)  Initialise  as  follows 

i)  Ep(-1)  -  ej(-l)  -  ep(-2)  -  n  (p  «  0,  1,  . . .  N) 

ii)  rVl)  -  rb(-l)  -  k  (-1)  -  k  (-1)  -  K  (-1)  -  L  (-1)  *  Y  (-D  -  «  (-1) 

PPPPPPPP 

=  a  (-1)  -  0  (all  p) 

P 

iii)  Retain  the  lowest-order  auxiliary  variables  for  all  n, 

Y.j^n)  ”  S^Cn)  “  o.jOO  *  0 

3)  For  each  successive  value  of  n,  starting  at  n  *  0,  and  before  iteration 
over  p,  assign  the  following  values  to  p  •  0  parameters 

ea(n)  -  ra(n)  -  x(n) 

with  x(n)  -  0  for  n  <  0 

eb(n)  -  rb(n)  -  x(n-M) 

e^n)  -  a(n)  ,  e^in)  “  a(n-M) 

£*(n)  -  £q(i>)  -  we*(n-l)  ♦  x2(n)  -wMx2(n-M) 
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For  each  value  of  n,  perform  the  following  iterations  from  p  from  0  to 


Ep*l<"> 

Vi(n) 


ep(n)  - 


eE(n-l) 


kpW 

ep(n“D 

kp<”> 

Ep(n) 


Vi(n) 


y»> 

ep<n-D 


K *>> 


kp(n> 


t 


Yp<n) 


6p(n) 


.  ,(n)  ♦  — ^ - |Yl  -  Y  ,  <n)\  r  *(n-l)  ♦  wM6  (n)rb(n-l)l 

P"1  .'(«)  L\  P  1  /  P  P-1  P  J 


i  (n)  +  — l (1  -  Y  |  (n)  J 

?1  <(n)L\  Pl  / 


r®(n-l)  +  wM6  , (n)rb(n-l) 
P  P  1  P 


] 


a  (n) 
P 


Vi(n)rP(n_1)  +  *  wMVi(n>)rp(n”l)] 

— - I  -  6  i  (n)re(n-l)  ♦  (l  +  wMa  (n)]rb(n-l) 

r(n)  L  P_1  P  V  P~l  )  P  J 


ap_l (n>  +  1 


5)  Perform,  for  p  from  0  to  N,  the  equaliser  recursions 


%(n)\  _  /V-l(n)\  kp(n-l)  /rp(tl)\ 

*p(n)/  '*p-l(n)'  Ep(n_1)  \rp(n)/ 


k  (n)  ■  wk  (n-1)  + 

P  P 


V.’“) 


l"Vi<n) 

Mx  x  \ 

w  6  .  (n) 

P~1 

wM6  . (n) 

P  1 

k(,  m 

-w  ll+w  a  , 

V  p-i 

CP 


Si 

The  predictor  output  is  eN(n),  and  the  equaliser  output  eN(n). 

Omit  eb,  rb,  eb  for  growing-fading  memory  algorithm;  in  addition,  set 

w  ■  1  for  growing  memory  algorithm.  Set  w  ■  1  and  retain  all  residuals  for 

M 

sliding  memory  algorithm.  (As  in  Appendix  2,  w  will  never  appear.) 

It  is  clearly  more  efficient  to  work  with  auxiliary  variables  (1  -  Yp(n) ) 
M 

and  (1  +w  a  (n))  instead  of  y  (n)  and  a  (n)  wherever  they  occur  in  1)  to  5) 

P  P  P 

above. 
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APPENDIX  3 


FAST  KALMAN  ALGORITHM 

1)  Define  the  (N  +  I)-vectors 

1)  F (n) ,  C(n) ,  D(n) ,  A(n) ,  B(n),  M(n) ,  N(n), 
and  the  scalars 

ii)  ea(n),  eb(n),  eC(n),  ed(n),  ra(n),  rb(n),  ea(n),  eb(n),  ee(n),  s(n),  t(n) 

{Comment:  these  variables  correspond  to  similar  expressions  in  Section  4,  for 
the  single  value  of  p  *  N}. 

2)  Initialise  as  follows 
ee(-l)  -  n 

A(-l)  -  C(-l)  -  D(-l)  -  B(-l)  -  F(-l)  -  0. 

3)  For  each  value  of  n,  starting  at  0,  perform  the  following  iterations. 
ea(n)  -  x(n)  +  AT(n-l)XN(n-l) 

eb(n)  *  x(n-M)  +  AT(n-l)X  (n-M-1) 

Si 

A(n)  »  A(n-l)  -  ea(n)C(n-l)  +  wM eb(n)D(n-l) 

{Comment:  This  recursion  is  equation  (4.4)  of  the  text.} 

eC(n)  -  x(n)  +  AT(n)X  (n-1) 

N 

ed(n)  -  x(n-M)  ♦  AT(n)X  (n-M-1) 

N 

c*(n)  -  w  e*(n-l)  ♦  e*(n)ec(n)  -  wM«b(n)ed(n) 
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{Comment:  This  is  the  forward  residual  energy  update  of  equation  (4.20).} 


/H(.)\  /-0  \  ^/a.\  > 

\s(n)7  \C(n-l)/  e  (n)  \A(n) / 

/  N(n)  \  _  /  0  \  +  ^  /  1  \ 

\V(nT/  yD(n-l)  J  £e(n)\A(n)/ 

ra(n)  -  x(n-N-l)  +  BT(n-l)XN<n)  , 

rb(n)  -  x(n-M-N-l)  +  BT(n-l)X  (n-H)  , 

N 

B(n)  *  ^B(n-l)  -  ra(n)M(n)  +  wM rb(n)N(n)^*  ^1  -  ra(n)s(n)  +  wM rb(n)t(n) 

C(n)  -  M(n)  -  s(n)B(n)  , 

D(n)  ■  N(n)  -  t(n)B(n) 

{Comment:  This  completes  the  prediction  filter  algorithm.} 

4)  ea(n)  -  a(n)  +  FT(n-l)XN<n)  , 

eb(n)  -  a(n-M)  +  FT(n-l)X  (n-M)  , 

N 

F(n)  ■  F(n-l)  -  ea(n)C(n)  +  wM eb(n)D(n) 

A  C 

The  prediction  filter  output  is  taken  to  be  e  (n)  or  e  (n) ,  and  that  of 
the  equaliser  as  e  (n). 

The  specialisations  for  each  of  the  growing,  growing-fading,  or  sliding 
window  algorithms  are  precisely  as  described  in  Appendix  1. 
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Figure  2 


Extension  of  lattice  filter  to  L-stage  equaliser  configuration 
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